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1.  INTRODUCTION 


In  the  area  of  signal  processing,  it  is  of  interest  to  detect  the  number 
of  signals  in  presence  of  noise  and  estimate  the  parameters  of  the  signals. 

The  problem  of  estimation  of  the  number  of  signals  was  discussed  by  Liggett 
(1973),  Wax,  Shan  and  Kailath  (1984)  and  others  in  the  literature  within  the 
framework  of  testing  for  the  equality  of  the  last  few  eigenvalues  of  the  co- 
variance  matrix.  The  model  considered  by  them  involve  expressing  the  observation 
vector  as  the  sum  of  Gaussian  white  noise  vector  and  a  vector  of  certain  linear 

9 

combinations  of  (random)  signals  radiated  by  sources.  In  this  case,  the  number 
of  signals  is  related  to  the  multtplicity  of  the  smallest  eigenvalues  of  the  co- 
variance  matrix  of  the  observation  vector.  The  problem  of  testing  the  hypothesis 
of  the  multiplicity  of  the  smallest  eigenvalues  of  the  covariance  matrix  was 
dealt  extensively  in  multivariate  statistical  literature  (e.g.,  see  Anderson 
(1963),  Krishnaiah  (1976),  and  Rao  (1983).  Wax  and  Kailath  (1984)  considered 
the  problem  of  determination  of  the  number  of  signals  using  information  theoretic 
criteria  proposed  by  Akaike  (1972),  Rissanen  (1978)  and  Schwartz  (1978). 

In  the  present  paper,  we  use  an  alternative  information  theoretic  criterion 
for  detection  of  the  number  of  signals  and  establish  its  consistency.  In  Section 
2  of  the  paper,  we  state  briefly  the  problems  considered  in  this  paper.  In 
Sections  3  and  4,  we  establish  the  consistency  of  our  procedures  when  the  variance 
of  the  white  noise  is  unknown  and'  known  respectively.  In  the  above 
sections,  we  assumed  that  the  distribution  underlying  the  observations  is  complex 
multivariate  normal.  In  Section  5,  we  establish  the  consistency  of  our  proce¬ 
dure  when  the  variance  of  white  noise  is  unknown  and  satisfies  certain  condition 
and  the  underlying  distribution  is  complex  el  1 i pti cal ly  synmetric.  The  problem 
of  detection  of  the  number  of  signals  when  the  noise  covariance  matrix  is  arbi- 


■*_>  Fj>  ■*_»  *_*  *  mV  kT*  %  r_*iVji’Vy«  .»  *.  -  **  f  *_*V/  •  »•**  j  -  *  ■  .  '•  -  ’  »*•«.’  ■  •  • 


Correction  to  "On  Detection  of  Number  of  Signals  in  Presence  of  Noise" 
by  L.  C.  Zhao,  P.  R.  Krishnaiah  and  Z.  D.  Bai ,  CMA  Technical  Report  85-37 

Page  11,  lines  11  -  12:  Replace 

"But,  for  k  >  q,  the  above  difference  is  not  asymptotically  positive 
with  probability  one" 


Wi  th  : 


"But,  for  k  >  q,  log  l_k  -  logLq  is  not  distributed  asymptotically 
chi-square". 
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trary  is  discussed  by  the  authors  in  a  forthcoming  paper  when  an  independent 
estimate  of  the  above  covariance  matrix  is  available.  In  this  case,  the  problem 
is  equivalent  to  finding  the  multiplicity  of  the  smallest  eigenvalues  of 
where  z^  and  z^  respectively  denote  the  covariance  matrices  of  the  observation 
vector  and  noise  vector.  This  problem  is  related  to  the  problem  of  relationship 
between  two  covariance  matrices  studied  by  Rao  (1983).  In  another  forthcoming  paper, 
we  considered  the  problem  of  drawing  inference  on  the  rank  of  the  regression 
matrix  using  information  theoretic  criteria  similar  to  those  used  in  the  present 
paper. 


Accession  For 


i 


2.  PRELIMINARIES  AND  STATEMENT  OF  PROBLEMS 


Consider  the  model 

x(t)  =  As(t)  +  n(t)  (2.1) 

where  A  =  [A(«1) .  —  »A(*  ) ] ,  s(t)  =  (s^t) ,. . . ,sq(t)) 1  ,n(t)  =  (n^t) ,. ..  ,np(t)) ' 

and  q  <  p.  In  the  above  model,  n(t)  is  the  noise  vector  distributed  independent 

of  s(t)  as  complex  multivariate  normal  with  mean  vector  0  and  covariance  matrix 
2 

a  Ip.  Also,  s(t)  is  distributed  as  complex  multivariate  normal  with  mean  vector 
0  and  nonsingular  covariance  matrix  ¥  and  A($.):  pxl  is  a  complex  vector  of  func¬ 
tions  of  the  elements  of  unknown  vector  *.  associated  wi-th  i-th  signal.  Also, 

s.j(t)  is  the  waveform  associated  with  i-th  signal.  Then,  the  covariance  matrix  e  of 

x(t)  is  given  by 

E  ■  A^A  +  cr2I  (2.2) 

where  A'  denotes  the  transpose  of  the  complex  conjugate  of  A.  We  assume  that 

x(tj) . x(tn)  are  independent  observations  on  x(t).  Now,  let  Aj  >_.  .  .>Ap  denote 

the  eigenvalues  of  E,  and  e1  >^. .  .>_  eq  denote  the  nonzero  eigenvalues  of  A^A'  .  Also, 

let  H  denote  the  hypothesis  A  >  An+,  =  ...=  A  =a2.  Under  H  ,  A.  =  a2  +  e. 

(i  =  1,  2,...,q)  and  Aq+J.  =  o2( j=l ,2 , . . .  ,p-q) .  So,  Hq  is  equivalent  to  the 
hypothesis  that  q  signals  are  transmitted.  Various  procedures  (e.g.,  see 
Anderson  (1963),  and  Krishnaiah  and  Waikar  (1971,  1972))  are  available  in  the 
literature  for  testing  the  hypothesis  for  given  value  of  q.  Wax  and  Kailath 
(1984)  used  Akaike's  AIC criterion  and  Schwartz-Ri ssanen  minimum  distance  length 
(MDL)  criterion  for  model  selection  for  determination  of  the  value  of  q. 

According  to  the  AIC  criterion,  the  value  of  q  is  estimated  to  be  q  where  q  is 


chosen  such  that 


AIC(q)  =  min{AIC(0),...,AIC(p-l)}  (2.3) 

and 

AIC(k)  =  -  2  logLk  +  2v(k,p).  (2.4) 

is  the  likelihood  ratio  test  statistic  for  testing  against  the  alternative 
that  z  is  arbitrary,  and  v(k,p)  denotes  the  number  of  free  parameters  that  have 
to  be  estimated  under  H^.  According  to  the  MDL  criterion,  the  value  of  q  is 
estimated  as  q  where  q  is  chosen  such  that 

MDL(q)  =  min{MDL(0),...,MDL(p-l)}  (2.5) 

MDL(k)  =  -  log  L.  +  v(k,p).  (2.6) 

k  2 

In  the  present  paper,  we  consider  the  following  alternative  information 
theoretic  criterion  for  model  selection  for  estimation  of  the  value  of  q. 
According  to  this  new  information  theoretic  criterion  for  model  selection, 

a  a 

we  estimate  q  with  q  where  q  is  chosen  such  that 


*  nnn{  I(0,C^) ,. .  .•  ,I(p-l  ,C^) } 

(2.7) 

l(k,C^)  =  ~  logL^  +  C ^  v(k,p) 

(2.8) 

and  is  chosen  such  that 

lim  (Cn/N)  =  0 

N-xo 

(2.9) 

lim  (CN/  loglogN)  =  » . 

N-XJ6 

(2.10) 

We  are  interested  in  establishing  the  strong  consistency  of  the  above  procedure 
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for  the  cases  when  a  is  unknown  and  known  under  the  assumption  that  the 
distribution  underlying  the  data  is  complex  multivariate  normal .  We  are  also 
interested  in  extending  the  above  results  to  the  situation  when  the  underlying 
distribution  is  complex  el  1 i pti cal ly  symmetric.  The  probability  of  correct 
detection  of  the  procedure  proposed  by  us  is  given  by 

P(CD)  =  P[I(q,CN)  -  I(k,CN)  <0;  k=  0,1 . (p-1);  k*,|Hq]. 

Investigation  has  to  be  made  on  the  evaluation  of  P(CD). 


3.  CONSISTENCY  OF  I(q,CN)  CRITERION  WHEN  c  IS  KNOWN 
AND  THE  UNDERLYING  DISTRIBUTION  IS  COMPLEX  MULTIVARIATE  NORMAL 


In  this  section,  we  establish  the  consistency  of  the  estimate  q  of  q  when 

**  p 

the  criterion  I ( q , CN )  is  used  and  a  is  unknown.  The  main  result  of  this  section 
is  stated  in  the  following  theorem: 

T  , JREM  3.1.  Suppose  x(t)  is  a  complex,  stationary  process  with  E ( x ( t ) )  =  0 

2  +  e 

and  E(x  * (t)x(t) )  <  <=°  for  some  e  >  0.  Also,  we  assume  that  (x(t.)>  i =1 , 2 , . . . } 

oo 

is  a  stationary  and  <t>  -  mixing  sample  sequence  with  <j>  being  decreasing  and  £  <P*(n) 

2  -  n=l 

<  Also,  6  (u,v)  >0  for  u,v=l,2,...  ,p,  and  y.  =  (yi-uv)  =  x(t^)x'(t.)  -  where 

(3-1) 

Let  q  be  chosen  such  that 


I(q,CN)  =  min{I (0,CM) , . . . ,1 (p-1 ,Cn)> 


NJ 


'N- 


(3.2) 


where  I(k,CN)  was  defined  by  (  2.8  )  and  CN  is  chosen  satisfying  (2.9)  and 

(2.  10). Then  q  is  a  strongly  consistent  estimate  of  q. 

We  need  the  following  results  to  prove  the  above  theorem. 

LEMMA  3.1.  Suppose  {x.,i  >_  1}  is  a  stationary  <|>-mixing  sequence  with  E(x^)  =  0 

2+e  m  i- 

and  E(|x^|  )  <  «■  for  some  e  >  0.  Also,  $  is  decreasing  with  \  <J>2(n)  <  °°. 

n=l 

Then 

lim  sup{  £  x./(2n<5^  log  lognS^)'5}  =  1  a.s.  (3.2) 

N-x»  i=l  1 

2  2  °°  00  i 

where  5 r  *  EXf  +  2  £  ?  0  is  assumed.  Here,  we  note  that  £  i>  (n)  <  » 

i=l  n-*-l 

implies  £  <  ». 

For  a  proof  of  the  above  lemma,  the  reader  is  referred  to  Reznik  (1968)  or 


Stout  (1974). 


are  all  pxp  symmetric  matrices  such 


LEMMA  3.2.  Suppose  that  A,  An,  n=l,2,.., 
that  An  -  A  =  0(«n)  and  on  +  0  as  n  -*■  ».  Denote  by  x^  _>  x2  >.  .  • .  >  ip  and 
xjn^  1  •••  1  the  eigenvalues  of  A  and  An  respectively.  Then  we  have 

x(n)  -  =  0(a)  as  n-*»,  i  =  l,...,p. 


PROOF.  Without  loss  of  generality,  we  can  assume  A  =  diaqT x, I  . x  I  1. 

-  1  Pj  r 

where  x^  >  >  . . .  >  xr.  According  to  Bai  (1984),  we  know  x|n^  -  x  .  -*■  0. 

At  first  we  consider  the  special  case  where  r  =  1.  For  any  i. 


0-  l»SB)Ip-*nl  ■  l<S?n)-^l>Ip  -  CAn-A)i 

■  (x<n)-'x.)p  -  l  (-D^xS^-LlP-S.  l3’3 

1  1  1=1  lit 


where  is  the  sum  of  all  -^-ordered  principal  minors  of  An  -  A.  Since  Ap  -  A 


0(an),  we  have  =  O(a^).  By  (3,3)  we  know 

'Xi0)  "  Xll  -  J^i  _  "1 


{n)  -  x,r£+10(a^) 


(n) 


which  implies  | \\  '  -  x^ |  =  0(ap)  as  n-*»  for  i  =  l,...,p. 

Now  we  consider  the  general  case.  Suppose  i  <  pr  We  have 


-B 


(n) 

21 


>(n) 

322 


-  |B^)N(x<n)-x1)iiii  -  b<;>  -  b<j>b<^-1b(;>i. 

Since  B ^  +  di ag[ ( X^-Xg )  1^^ » -  -  •  »(xi“xr)Ivi  ^22  1S  nonsin9ular  for  all 
large  n.  Thus 


1  f  TL  C  H \  T  _  R(n)  n(^)D(^)“lq(n)  I  _  r, 

IUi  V1^  B11  B12  °22  B22  1  “ 

From  An  -  A  =  0(an),  it  follows  that  b|^  =  0(an)  and  B^B^-1^!^ 
Using  the  result  proved  just  before,  we  get 

»{n)  -  »i  ■  °(%> 

for  i  =  1,  By  the  same  approach,  we  can  prove 

/  p\ 

^  -  0(  )j  i  *  •  *  *  »  h  _  1  j  •  •  • 


(3.z 


O(c^) 


(3.! 


which  complete  the  proof  of  the  lemma. 

N 

Let  >_  ...  >_  l  denote  the  eigenvalues  of  e,  where  Nz  =  \  X (t . )X 1  (t - ) 
P  i =l~  1  ~  1 

Using  Lemma  3.1  and  the  conditions  imposed  on  {X(t.),  i  =  1,2,...},  we  have 


‘z-z-  O(J^M)  a.s. 


(3.6 


Now,  applying  Lemma  3.2,  we  obtain 
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When  x(t)  is  distributed  as  complex  multivariate  normal,  the  likelihood 
function  for  testing  the  hypothesis  against  the  alternative  that  z  has 
general  structure  is  known  to  be 


Lk  =  f  S  l  VN(H<)<- 

K  ii=k+l  w  P  K  i=k+l  1  j 


(3.8) 


We  will  first  prove  the  consistency  of  the  method  based  upon  the  criterion 
I  ( q ,  C  )  when  k  <  q.  Let  G,(k)  =  log  L.  and 


Gk  =  Gx(k)  -  CN[k(2p-k)+l] 


(3.9) 


where  k(2p-k)+l  is  the  number  of  free  parameters  that  have  to  be  estimated 
under  the  hypothesis  and  Lk  is  given  by  (3.  8).  Using  (3.7),  we  get 

lim  ^(G,(q)-G1(k))  =  W(q,k)  a.s.  (3.10) 


where 


W(q,k)  = 


logt.jw.)  -  (p-qilog^ 


P  1  ? 

-  log(  I  X.)  +  (p-k)log(-r— r-y  l  X-). 

i =k+l  1  k)  i =k+l  1 

+  (p-k)[log(  c^A^agAg)  -  (o^logA-j+ctglogA^ ] 


(3.11) 


(q-k)/(p-k) ,  a2  =  (p-q)/(p-k) 


-  1  K 

T^iT)  j=ki+1M 


a2  s  WV  ,4+1xf 


where 


By  the  well  known  arithmetic  mean  geometric  mean  inequatity,  we  have 


W(q,k)  >_  ( p-k)  nogCa^Ai+c^Ag)  "  (a^logA^+a^logAg) ]  •  (3 

Also,  A1  >  A2«  By  Jensen's  inequality,  we  have 

W(q,k)  >  0.  (3 

Using  (3.9),  (3.10) ,  (3.13)  and  lim(Cw/N)  =  0,  we  obtain 

N-*»  N 

G(q)  -  G(k)  =  NW(q,k)(l+0(l) )  a.s. 

So,  with  probability  one  for  large  N,  we  have 

G(q)  >  G(k).  (3 

Now  we  assume  k  >  q  and  k  <  p-1.  Without  loss  of  generality  we  can 
2 

assume  a  =  1.  By  (3.7)  we  have  lim(-f..-l)  *  0  a.s.  for  j  =  q+l,...,p.  Using 

N-*°°  ^ 

Taylor's  expansion,  we  get  for  k  >  q 

P  1  5 

G,(k)=N{  I  logd+^-l)  -  (p-k)log(l+-ir  V  U,-l))> 

1  i =k+l  1  P  K  i=k+l  1 

P  P 

“  "  f  ^(^1--l)2(l+o(l))  +  2 (p'-k) ( .  (l+o(l))  a.s 

By (3. 7)  we  see  that 

Gj(k)  =  O(loglogti)  a.s.,  p-1  >  k  >  q 
Gj(q)  =  O(loqlogN)  a.s. 


From  (3.9),  (3.15)  and  CM/ loglcgH) 


,  we  get 


G(q)  -  G(k)  =  CN(k-q)(2p-k-q)  +  O(loglogtt) 


(3.16) 


=  CN ( k-q ) ( 2p- k-q ) ( 1+  o  ( 1 ))  a.s. 

Thus  with  probability  one  for  large  N  we  have 

G(q )  >  G(k).  (3.17) 

From  (3.14)  and  (3.17),  it  follows  that  with  probability  one  for  large  N 

A 

q  *  q. 

Thus  the  proof  of  Theorem  1  is  completed. 

When  x(t)  is  distributed  as  real  multivariate  normal,  the  proof  goes  along 
the  same  lines  as  in  the  complex  case. 

Wax  and  Kailath  (1985)  showed  that  (MDL(q)-MDL(k))  is  asymptoti cally 
negative  with  probability  one  for  k  <  q.  But,  for  k  >  q,  the  above  difference 
is  not  asymptotically  positive  with  probability  one.  So,  the  strong  consistency 
of  the  MDL  criterion  does  not  follow  from  the  arguments  6f  Wax  and  Kailath  (1985) 
But,  it  follows  from  our  results  by  taking  =  -|logN.  Wax  and  Kailath  (1985) 
pointed  out  that  the  ATC  criterion  is  not  consistent.  Hannan  and  Quinn  (1979) 
considered  an  information  theoretic  criterion  to  determine  the  order  of  an 
autoregressive  process;  this  criterion  will  be  discussed  in  a  subsequent 
communication. 


4.  DETECTION  OF  THE  NUMBER  OF  SIGNALS  WHEN 
VARIANCE  OF  WHITE  NOISE  IS  KNOWN 

In  Section  3,  we  discussed  a  model  selection  criterion  for  detection  of 

the  number  of  signals  when  the  distribution  underlying  the  observations  is 

complex  multivariate  normal  and  the  variance  of  white  noise  is  unknown.  In 

this  section,  we  derive  analogous  criterion  when  the  underlying  distribution 

is  (real)  multivariate  normal  and  the  variance  of  the  white  noise  is  known.  The 

strong  consistency  of  the  above  criterion  is  also  established. 

In  the  model  (2.1),  we  assume  that  the  noise  vector  n(t)  is  distributed  as 

2 

the  multivariate  normal  with  mean  vector  0  and  covariance  matrix  a  Ip,  A  is 
a  real  matrix  of  rank  q  <  N,  and  the  signal  vector  s(t)  is  distributed  independent 
of  n(t)  as  a  multivariate  normal  with  mean  vector  0  and  nonsingular  covariance 
matrix  ?.  Then.^the  covariance  matrix  of  x(t)  is  I  s  AyA'  +  a  I.  We  assume  that 
a  is  known.  Without  loss  of  generality,  we  assume  that  a  -  1.  Let  X1  _>  ...  _>  xp 
denote  the  eigenvalues  of  E.  Now,  let 

V  \  1  •••  1  xk  >  xk+l  =  •”  =  xp  =  1‘  t4,1) 

The  k-th  model  M^  is  the  one  for  which  ek  is  true.  We  are  interested  in 
selecting  one  of  the  p  models  Mq,M^ ,. . . ,Mp_j. 

The  likelihood  function  is  given  by 

L ( 0 )  =  -  -j-  logjsj  -  tr  E  (4.2) 

where 

-  t! 

E  =  l  x-x'/N.  (4.3) 

j=l 

Also,  let  <5,  >_  ...  >_  <5  be  the  eigenvalues  of  e.  In  addition,  let  t  denote 


the  number  of  5..'s  which  are  greater  than  one.  Also,  let  d  <_  t.  We  will  first 
calculate  L  (x.  .  »•  ••  »0  s  Sup  L(0)  where  Sup  L( e )  indicates  that  L ( 0)  is 

0  ***  Pa  a 


maximized  subject  to  the  condition  that  x^  _>  ...  _>  x^  >1. 

Write  A  =  diag(x^,...  ,xp) ,  a  =  diag(5j,...  ,6  ).  There  exist  two  real 
orthogonal  matrices  ^  and  0^  such  that 


t  =  OjAOj,  z  =  02a02. 


Put  Q  *  °2°1'  "*'*1en  We  ^aVe 


L(9)  =  -  7-  T  logx.  -  £  trA"*Q' aQ. 
c  j=l  J  £ 


Since  Q  is  orthogonal ,  we  have 


1  1  r 

trAQ  aQ  >  l  S./\. 

j-1  1  1 


and  the  equality  holds  for  Q  =  Ip  (see  Von  Neumann  (1937)).  So, 

P  P 

Sup  L(e)  *  Sup{-  £  l  logx,.  -  £  (s^/x.)} 

*  *  c  j=l  3  c  j=l  J  J 


(4.4) 


i.e. , 


L*(xd+1 . V  S  “  7  log(xd+l**-V  ■  7  j  j+1  5jAj 


+  sup{-  ^  log  (x^...xd)  ■  ■j’  ,1  ^/x^} 

j=l 


(4.5) 


*  -  7  lo9^d+l  V  '  7  •  J+1  5j/Xj 


-  log(ii.  ..6d)  ~  ■j  d 


(4.5) 


where  the  supremum  is  attained  at  5.  3  Xj  for  jr»  l,2,...,d. 
First,  we  assume  that  t  <  k.  In  this  case 

sup  L(8)  =  sup  L*(x  ...... X  ) 

0€0U  «(t,k)  T+1  p 


(4.6) 


where  sup  indicates  that  the  supremum  is  taken  over  xT+.  >_  ...  >_  x.  >  1  and 

4>(T,k) 

xk+l  *•*  “  xp  “  But 

sup^)L*UT+1,...,xp)  .  -H 

N  ^  N  k  k 

'  7  J.,-  Si  +  2  *up  ,  t  lo9  V  .  I  VV  (4.7), 

i  k+1  x^^>_. . . _>_x >  1  i=r+l  j=x+l  ^  i 

Also,  5.  <  1  and  x^  >  1  for  1  »  t  +  1 . k.  So, 

-  (log  x1  +  («i/xi))  <  6.  (4.8) 


and  the  equality  holds  only  when  x^  =  1.  Since  the  above  x^'s  can  be  arbitrarily 
approximated  to  one,  we  have  p 

sup  L ( 0 )  *  -  £  l  logfi,  -  Nt/2  -  £  [  {.  (4.9) 


sup  I 

e«ek 


L(6)  ~  -  j  l  logs.  -  Nt/2  -  £  6. 

*  i=l  1  L  i=x+l  1 


when  ,T <  k.  Next,  let  t  >  k.  Then 


sup  L(e)  ■  -  I*  log(51...5k)  ■  Jk 
0eek 


+  sup  {-  j  1og(xk+r..xp) 

xk+r***'xp“1 


4  j-k+1  J  J 


(4.10) 


Combining  (4.9)  and  (4.10),  we  obtain 


sup  L(e)  --*■  l  logs,  -^  +  5-  l  (logs. +1-5.)  . 

8€0k  £  i=l  12  2  i=1+fnin(T>k)  1  1 

But  the  supremum  of  L(0)  over  the  whole  parametric  space  is  given  by 

-§  !  logs,  -f 
L  1-1  1  c 


(4.11) 


V  V  s  ' 


*„•  *  •  *„  V"  '  *  ’  /  *«P  *  -  ’  -  ‘  .  *  ■*  »'*  .  *  .**  ,  *  .'*•  K*-  A  V 


.  \  \  \  N 
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So,  the  logarithm  of  the  likelihood  ratio  test  statistic  for  testing  0k  is 
given  by 


+min(  r,k) 


(log6.j+l-5i)  • 


Now,  let 


P 

I  (log<s.+l-<5. )  . 
i=k+l 


(4.12) 


(4.13) 


We  know  from(3.7)  that 

j,  -  X,  -  on^sa^)1/*)  a.s.  (4.14) 

Suppose  the  true  model  is  Mq.  Then 

xi  -  **•  -  Xq  *  Vl  =  =  XP  =  1  '  (4'15) 

From  (4.14),  we  know  with  probability  one,  that  si  >  1  for  i  =  1,2 . q 

and  min(q ,t)  =  q  for  large  N.  So,  the  statistics  Lq  and  Lq  have  the  same  distri 
bution  asymptotically.  Here,  we  note  that  Anderson  (1963)  suggested  to  use  Lq 
as  a  statistic  to  test  eq  and  pointed  out  that  the  asymptotic  distribution  of 
Lq  is  chi-square  with  (p-q)(p-q+l)/2  degrees  of  freedom.  Rao  (1983)  pointed 
out  that  L„  is  not  the  LRT  statistic. 

q 

We  will  now  consider  the  problem  of  selecting  one  of  the  models  Mq,M^, 
...,Mp  j  by  using  an  information  theoretic  criterion.  Let 

G(k)  =  Lk  -  CNk(2p-k+l)/2  (4.16) 

where  CN  satisfies  the  following  conditions 

(1)  11m(CN/N)  -  0,  (ii)  lim(CN/log1ogN)  =». 

N-h»  N-h» 


We  select  the  model  M*  where  q  is  chosen  such  that 

G(q)  =  max  G(k) .  (4 

QlKlP"! 

a 

We  will  now  show  that  q  is  a  consistent  estimate  of  q. 

A 

THEOREM  4.1.  If  Nz  is  distributed  as  central  Wishart  matrix  with  N  degrees 

A  A 

of  freedom  and  E(z)  =  z,  then  q  is  a  strongly  consistent  estimate  of  q. 
PROOF.  Suppose  that  is  the  true  model  and  k  <  q.  We  have 

G(q)  -  G(k)  =  Lq  -  Lk  -  CN(q-k)(2p-k-q+l)/2.  (4 

As  mentioned  above,  with  probability  one,  we  have  for  large  N, 

S.j  >  1,  i  =  l,...,q,  and  min(q,t)  =  q.  (4 

Thus  with  probability  one  for  large  N, 

P  P 

L_  "  Lk  =  7  N  '  I  (logfii+l-«4)  -  t  N  l  Uoga.+l-s.) 
q  K  6  i  =q+l  1  1  L  1-R+l  1  1 

■  -  7  N  |  (log«1+l-«i)  *  \  N  WN(q,k), 


where 


q 

w«(q,k)  =  -  l  (log  6 . +1-6 . ) . 
n  i =k+l  1  1 


We  have 


q 

lim  WN(q,k)«ii*  W(q ,k)  -  -  J  Oogx.+l-X.)  >  0. 

N  1-R+l  1  1 


Hence,  with  probability  one,  we  have  for  larqe  N, 


L-U>tN  W(q,k), 


G(q)  -  G( k)  >  0. 


(4.20) 


Here  we  used  the  condition  lim  CM/N  =  0. 

N-*«  N 

Now  we  assume  that  k  >  q.  By  (4.19)  we  have 

P 

!  Lg  -  Lkl  -  N  J  +  1  -  5.|. 

q  K  i =q+l  1  1 

Since  1-5^  -  1|  *  q ( (—.9 ^ ^ )  a.s.  for  -j  >  q}  we  can  use  Taylor's  expansion 

N 

to  get 

p  1  ? 

|L-U±N  l  i(5rir(l4o(l))  a.s. 
q  K  i =q+l  6  1 

*  O(logTogN)  :  a.s. 

From  CN/loglogN  «,  we  see  that  with  probability  one,  for  large  N, 

G(q)  -  G(k)  =  O(loglogN)  +  CN(k-q) (2p-k-q+l)/2  >  0.  (4.21) 

From  (4.20)  and  (4.21),  it  follows  that  with  probability  one  for  large  N, 

a 

q  =  q. 

Thus  Theorem  4.1  is  proved. 

When  the  underlying  distribution  is  complex  multivariate  normal,  the 
proof  for  the  consistency  of  the  method  goes  along  the  same  lines  as  in  the 
real  case. 
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5.  DETERMINATION  OF  THE  NUMBER  OF  SIGNALS  WHEN  THE 
UNDERLYING  DISTRIBUTION  IS  ELLIPTICALLY  SYMMETRIC 

In  this  section,  we  discuss  procedures  for  determination  of  the 
number  of  signals,  transmitted  when  the  underlying  distribution  is  real 
or  complex  elliptically  symmetric.  Here,  we  note  that  a  random  vector  y 
is  said  to  to  elliptically  symmetric  if  its  density  is  of  the  form 

f(y)  -  |ir1/2g(j(y-|.),z'1(y-!;))  (5.1) 

where  g  is  a  non-increasing  function  in  [o,°°).  Multivariate  normal  and 
multivariate  t  distributions  are  special  cases  of  the  elliptically  symmetric 
distributions.  Kelker  (1970)  proposed  the  elliptically  symmetric  distributions 
and  studied  some  of  its  properties.  Krishnaiah  and  Lin  (1984)  proposed 
complex  elliptically  symmetric  distribution  and  studied  some  of  its  properties. 
A  complex  random  vector  x  =  Xj  +  ixg  is  said  to  be  distributed  as  complex 
elliptically  symmetric  distribution  if  its  deiisity  is  of  the  form 


f  (x)  =  |zr1h((x-u)*E"1(x-w))  (5.2) 

where  Z  is  Hermitian.a  denotes  the  complex  conjugate  of  a,  and  h  (•)  is  a 
non-increasing  function  in  [o,»).  The  covariance  matrix  of  (xj,  x£)  has 
the  structure 


Complex  multivariate  normal  considered  by  Wooding  (1958)  and  Goodman  (1963) 
and  complex  multivariate  t  distribution  are  special  cases  of  the  complex 
el  1 i pti cal ly  symmetric  distribution.  The  density  of  the  complex  multivariate 
normal  is  known  to  be 

f(x)  =-^lrexp{.(x.y),r1(^)}.  (5.3) 


Now,  consider  the  signal  process  x(t)  in  (2.1)  but  assume  that  the  joint 
density  of  x1  =  x(t1),...,xN  =  x(tN)  is 


f(x1»...,xN)  =  | z| “Nh(N  tr  Z-1l)  (5.4) 

-  !  - 

where  Nz  =  y  x.x!.  Let  X,  >  ...  >  x_  be  the  eigenvalues  of  z  and  let 
j=H~J  i  -  -  p 

* 

^  >  ...  >  ly  denote  the  eigenvalues  of  z.  Also,  let  0k  denote  the  model  in 
which 


xl  1  •••  1  xk  >  xk+l  *.■•••  ~  xp  =  ^ 

2 

where  a  is  unknown.  Let  f(Xj,... ,x^|ek)  denote  the  likelihood  function 
under  k-th  model  ©k.  Also,  let 


L(ek)  =  log  f(x1,...  ,xN|ek)  (5.6) 


for  k  =  0,1 ,. . . ,p-l.  We  know  that  for  given  x,,...,xn  the  minimum  of 
p  1  P 

1  —  1 

trzz  is  >  x.  1.  (see  von  Neumann  (1937)).  So, 

j=l  1  1 

P  p  i 

max  L  (0. )  =  max{-N  T  log  x.  +  log  h(N  V  x  _•£.)} 

0R  K  0k  j=l  J  j=l  J  J 


(5.7) 


where  the  maximum  is  taken  subject  to  (5.5).  Suppose  h(t)  has  a  continuous 
derivative  h’(t)  on  [0,»)  and  the  equation 


Nph(y)  =  .y  [h '  (y ) 


has  a  unique  solution  y  =  Np/y  Then,  the  above  maximum  is  reached  at 


TTJZ  +...+£„)}'  Th 
p^  kfi  P 


max  L(3  )  = 
0k  k 


(Np/2)logTh  +  logh(Np/yh)  -  N  J^log^  +  G1(k) 


where 


G,(k} 


’ 

Nlog  n  ^./(FT  I  ^)P~k  . 

i-k+i  1  P“lci-|Cfl1 


Under  the  conditions  of  Theorem  3.1.  we  observe  that,  for  k  <  q, 


lim  ^-(G. (  k)-G.(  k-1)) 
N-x» 


=  W(k,k-1)  >  0  a.s. 


and  for  k  >  q( k<p-l) 


^{G1(k)-G1(k-1))  -  0(  Isa^satL) 


where 


W(  k,  k-1)  =  (p4c+l)[log(jp3RTry(xk+1+...+xp) 

‘  (p-k+l)logxk  "  (p-k+l)1o9^TprkT^k+l+'  “+xp^  • 


(5.1- 


see  that  Gj(k)  is  non-decreasing  function  of  k  for  ke{0,l ,. . . ,p-l).  If 


we  draw  the  points  (0,  G^O)),  (l.G^l)),  . . .  ,(p-l  ,G1(p-l) )  in  the  Descartes 
coordinate  plane,  and  construct  a  polygonal  line  with  these  points  as  its  p 
vertexes,  then  G^( k)-G^( k-1)  is  just  the  slope  of  the  k-th  segment.  Suppose 
that  q  is  the  true  number  of  signals.  For  convenience  we  temporarily  assume 
q  >  0.  As  shown  in  (5.12)  and  (5.13),  we  can  assert  with  probability  one  that, 
for  large  N, 


Gj(k)  -  Gj  (k-1)  >  CjN  for  k  <  q  (5.15) 

and 

Gj(k)  -  G^ ( k-1 )  =  O(loglogN)  for  q  <  k  _<  p-1,  (5.16) 

where  Cj  >  0  is  a  constant.  Thus  we  see  that,  the  slope  G^k)  -  G^k-l) 
has  a  significant  change  for  k  _<  q  and  q  <  k  £  p-1,  and  the  true  value 
q  is  just  the  largest  k  for  which  Gj(k)  -  G^k-l)  >  CN,  where  CN  satisfies 
the  following  conditions: 

1  im(CM/N)  =  0  lim(CJkglogN)  -  »  .  (5.17) 

N-**>  n  N-h» 

If  we  put  G^(-l)  =  -  then  the  same  is  true  for  q  =  0.  Motivated  by 

A 

(5.15)  and  (5.16),  we  estimate  the  number  of  signals  q  with  q  where 


q  =  max{k<p-l:  G^k)  -  G^k-1)  >  CN>.  (5.18) 

Under  the  conditions  of  Theorem  3.1,  we  can  show  that  q  is  a  consistent 
estimate  of  q  by  following  the  same  lines  as  in  Section  3. 


In  general,  we  do  not  know  whether  the  conditions  of  Theorem  3.1  are 


satisfied.  In  these  cases,  we  make  the  following  assumptions: 

(i)  lim  •^(G1(k)-G1(k-l))a=S0  for  k  >  q 

(ii)  lim  ^(G1(  k)-61(fc-l)f=S  W(k,k-1)  >  0  for  k<.q 
N-*» 


(5.18) 


where  we  denote  Xg  =  «  f0r  convenience.  In  this  case,  we  need  to  assume  that 

2 

the  smallest  non-zero  eigenvalue  of  A^A' is  distinguishable  from  a  ,  namely, 
the  ratio  of  signal  intensity  to  that  of  noise  can  be  detected  by  the  sensor. 
We  assume  that  (x  -a  )/u  _>  e  >  0  and  e  is  known  for  the  given  receiver.  In 

A  A 

this  case,  we  estimate  q  with  q  where  q  is  chosen  such  that 

q  ■ max{k<p-l:  G^k)  -  G^k-1)  >  £  N),  (5.19) 

where  we  denote  G^-l)  =  -  «  for  convenience.  Also, 

“  =  o<klp.i<P’k+1>(1°9<P-^T  +  '  P^logS>  *  0>  (5-20) 


and 


s  ■ 1  ■  ?(fer  • 


(5.21) 


We  now  establish  the  strong  consistency  of  q.  To  prove  this,  we  write 


Suppose  that  q  is  the  true  number  of  signals  and  k  <  q.  Then 


23 


Tim  -i(G1(k)-G1(k-l))  =  W(k,k-1)  a.s. 
N->oo  n  1  1 


w(k.k-l)  =  (p-k+l){ log (ak+BkAk)  -  6k1ogAk)  >  0. 
Consider  fk(x)  =  log(ak+8kx)  -  8klogx  for  x  (0,1].  We  have 

^(x)  =  -  c‘|<3k(l-x)/x(ak+6kx)  <  0,  0  <  x  £  1 
so  that  f. (x)  is  a  decreasing  function  on  (0,1].  But  if  x  > 

k  q  - 

then  for  0  k  _<  q-1 , 


+  1  -  prmfiii  -  pnfer' 8 


and 


Aq=»2Aqi1|?<S.. 


Thus  for  0  _<  k  <_  q , 


W(k.k-l)  >  y  . 


From  (5.22)  and  (5.23),  it  follows  that,  with  probability  one 

Gj ( k)  -  G1(k-1)  >  J  N,  k  <  q. 

On  the  other  hand,  if  q<k<_  p-1. 


•  1 


11m  i(G  (k))  -  G.(k-l))  »  0  a.s. 
N-**>  "  1  1 


So  with  probability  one,  for  large  N, 


G^(k)  -  G^(k-l)  <  j  N  for  q  <  k  <_  p-1, 


(5.22) 


(l+e)a2. 


(5.23) 

for  large  N , 

(5.24) 

(5.25) 

(5.26) 


REFERENCES 


Akaike,  H.  (1972).  Information  theory  and  an  extension  of  the  maximum  likeli¬ 
hood  principle.  In  Proceedings  of  the  Second  International  Symposium  on 
Information  Theory,  Supp.  to  Problems  of  Control  and  Information  Theory, 
pp.  267-281. 

Anderson,  T.W.  (1963).  Asymptotic  theory  for  principal  component  analysis. 

Annals  of  Mathematical  Statistics,  34,  122-148. 

Bai ,  Z.  D.  (1984).  A  note  on  asymptotic  joint  distribution  of  the  eigenvalues  of 
a  noncentral  multivariate  F  matrix.  Tech.  Report  No.  84-49.  Center  for  Multi 
variate  Analysis,  University  of  Pittsburgh. 

2  ■ 

Bartlett,  M.S.  (1954).  A  note  on  the  multiplying  factors  for  various  x 

approximations.  Journal  of  the  Royal  Statistical  Society,  Ser.  B,  16, 
296-298. 

Goodman,  N.R.  (1963).  Statistical  analysis  based  on  a  certain  multivariate 
complex  Gaussian  distribution  (an  introduction).  Annals  of  Mathematical 
Statistics ,  34,  152-176. 

Hannan,  E.J.  and  Quinn,  B.G.  (1979).  The  determination  of  the  order  of  an 
autoregression.  Journal  of  the  Royal  Statistical  Society,  Ser.  B,  41, 
1.90-195 . 

Kelker,  D.  (1970).  Distribution  theory  of  spherical  distribution  and  a  loca¬ 
tion  scale  parameter  generalization.  Sankhya,  Ser.  A,  43,  419-430. 

Krishnaiah,  P.R.  (1976).  Some  recent  developments  on  comp1ex  multivariate 
distributions.  Journal  of  Multivariate  Analysis,  6,  1-30. 

Krishnaiah,  P.R.  and  Lin,  J.  (1984).  Complex  elliptical  distributions.  Tech¬ 
nical  Report  No.  84-19.  Center  for  Multivariate  Analysis,  University  of 
Pittsburgh. 

Krishnaiah,  P.R.  and  Schuurmann  ,  F.J.  Computations  of  Complex  Multivariate 

Distributions  (in  preparation) .  To  be  published  by  North-Holi  and  publish¬ 
ing  Company. 

Krishnaiah,  P.R.  and  Waikar,  V.B.  (1971).  Simultaneous  tests  for  equality  of 
latent  roots  against  certain  alternatives,  I.  Annals  of  the  Institute 
of  Statistical  Mathematics,  23,  451-468. 

Krishnaiah,  P.R.  and  Waikar,  V.B.  (1972).  Simultaneous  tests  for  equality  of 
latent  roots  against  certain  alternatives,  II.  Annals  of  the  Institute 
of  Statistical  Mathematics,  24,  81-85. 

Liggett,  W.S.  (1973).  Passive  sonar:  fitting  models  to  multiple  time  series. 

In  Signal  Processing  ( J.Vf.  Griffith  et  al ,  editors)  pp.  327-345.  Academic 
Press,  New  York. 

Rao,  C.R.  (1983).  Likelihood  ratio  tests  for  relationships  between  two  co- 
variance  matrices.  In  Studies  in  Econometrics,  Time  Series  and  Multi¬ 
variate  Stati  sties  (T.Amemiva.  SJ(arl  in  and  L.  Goodman,  editors).  Academi c 
Press,  Inc.  New  York. 


26 


Reznik,  M.K.  (1968).  The  law  of  the  iterated  logarithm  for  some  classes  of 

stationary  processes.  Theory  of  Probability  and  Its  Applications,  8,  606-621. 

Rissanen,  J.  (1978).  Modeling  by  shortest  data  description.  Automatica,  14, 
465-471 . 

Schmidt,  R.O.  (1979).  Multiple  emitter  location  and  signal  parameter  estima¬ 
tion.  In  Proc.  RADC  Spectrum  Estimation  Workshop,  Rome. 

Schwartz,  G.  (1978).  Estimating  the  dimension  of  a  model.  Annals  of  Statistics, 

6,  461-464. 

Stout,  W.F.  (1974).  Almost  Sure  Convergence.  Academic  Press.  New  York. 

Von  Neumann,  J.  (1937).  Some  matrix-inequalities  and  metrization  of  metric 
space.  Tomsk  Univ.  Rev. ,  1,  286-300. 

Wax,  M.  and  Kailath,  T.  (1984).  Determination  of  the  number  of  signals  by  in¬ 
formation  theoretic  criteria.  IEEE  Transactions  on  Acoustics,  Speech, 
and  Signal  Processing,  ASSP-33 

Wax,  M.,  Shan,  T.J.,  and  Kailath,  T.  (1984).  Spatio-temporal  spectral  analysis 
by  eigenstructure  methods.  IEEE  Transactions  on  Acoustics,  Speech  and 
Signal  Processing,  ASSP-32,  817^827 . 

Whalen,  A.D.  (1971).  Detection  of  Signals  in  Noise,  Academic  Press.  New  York. 

Wooding,  R.A.  (1956).  The  multivariate  distribution  of  complex  normal  variables. 
Biometrika,  43,  212-215. 


/  jvO'jlC  ZtZ< i  i)  ft  i  (  t 


m 


iwwwjww 


tCuHl  r  r  Cl.  AS^IF  it  A  l  iQn  O ►  Tmb  MAOt  (Whoa  iJmim  t.ntotvdj 


1 


REPORT  DOCUMENTATION  PAGE 


READ  INSTRUCTIONS 
UEKORE  COMPLETING  KOKM 


EClPlENT'S  catalog  number 


4.  TITLE  (mnJ  Subtttl.J 


s  tvpe  of  report  a  period  covered 


On  detection  of  number  of  signals  in  presence!  Technical  -  October  1985 
of  white  noise 


6  PERFORMING  org.  REPORT  number 


AU  T  mORC •) 


8  CONTRACT  or  grant  numberoi 


L.  C.  Zhao,  P.  R.  Krishnaiah,  and  Z.  D.  Bai 


»  PERFORMING  ORGANIZATION  NAME  AND  ADDRESS 


Center  for  Multivariate  Analysis 
515  .Thackeray  Hall  & 

University  of  Pittsburgh,  Pittsburgh.  PA  15260 


1 4.,  MONITORING  ACENCYtH-AMc 


12.  REPORT  DAT 

October  1985 


13-  number  of  pages 


dl7  formal  from  'Controlling  Office)  IS-  SECURITY  CLASS,  fof  t  hi*  report) 

Unclassified 

,  ~fs».“ OECll  ASSl  FI  cation?  down  GRADING 

SCHEOULE 


IS.  OlST  Ml  BU  Ti  ON  STATEMENT  (of  th  to  Report; 


Approved  for  public  release;  distribution  unlimited 


*7  DISTRIBUTION  statement  (o I  tho  mboUmct  onfrod  In  Block  20.  H  dlllmttu  from  Report) 


19  KEY  WORDS  (ConfJnut  on  tovo too  «fd«  1/  nocoooory  and  IdonUty  6k  bloc*  numbar; 


Ell ipti call y  symmetric  distribution,  information  theory,  model  selection 
principal  component  analysis,  signal  detection  and  white  noise. 


20  ABSTRACT  fConlfnua  on  rovotoo  ifda  If  nacaatary  and  tdonlily  by  block  number / 

In  this  paper,  the  authors  propose  procedures  for  the  detection  of  the 
number  of  signals  in  presence  of  Gaussian  white  noise.  The  methods  used  fall 
within  the  framework  of  the  model  selection  procedures  using  an  information 
theoretic  criterion.  The  strong  consistency  of  the  estimates  of  the  number 
of  signals,  under  different  situations,  is  established.  Extensions  of  the 
results  are  also  discussed  to  the  case  when  the  noise  is  not  necessarily 
Gaussian. 


Unclassified 


00  i  j an^ti  1473 


SECURITY  CLASSIFICATION  OF  This  PAGE  Finn  Ds t«  Ent.r.Jt 


